Rare-event induced binding transition of heteropolymers 
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Sequence heterogeneity broadens the binding transition of a polymer onto a linear or planar 
substrate. This effect is analyzed in a real-space renormalization group scheme designed to capture 
the statistics of rare events. In the strongly disordered regime, binding initiates at an exponentially 
rare set of "good contacts". Renormalization of the contact potential yields a Kosterlitz-Thouless 
type transition in any dimension. This and other predictions are confirmed by extensive numerical 
simulations of a directed polymer interacting with a columnar defect. 
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The binding transition of a polymer onto another ex- 
tended object, such as a second polymer, a membrane, or 
an interface, is of interest in a variety of physical circum- 
stances jj] . The problem has been studied in the context 
of wetting in two dimensions |^|,^| , the depinning of a flux 
line from a columnar defect in type-II superconductors 
jl , the denaturation of double-stranded DNA molecules 
in solution and the localization of a copolymer at 

a two- fluid interface J?j]. In surface physics, formation of 
facets on vicinal surfaces can be viewed as a bound state 
problem involving surface steps Q . 

The mathematical framework for the homopolymer 
binding transition is well-established. The binding tran- 
sition of a heteropolymer (i.e., a disordered sequence of 
two or more letters) is expected to be broadened by fluc- 
tuations in the monomer contact energy with the target 
object. But a quantitative characterization of the broad- 
ening effect, particularly when the disorder fluctuations 
are strong, is still largely missing. At present, the only 
settled case is that of a directed heteropolymer inter- 
acting with a columnar defect, where weak variations in 
the contact energy have been shown to have no effect 
on the transition, provided the transverse dimension d 
is between one and three [§|§. For d = 1, Forgacs et 
at, using a field-theoretic method, found that weak dis- 
order only introduces subleading order corrections to the 
singular part of the free energy and does not smear out 
the specific heat jump at the transition This conclu- 
sion was put into doubt by Derrida et al., who performed 
a perturbative renormalization group (RG) analysis and 
numerical calculations ||. The problem has since been 
re-examined with conflicting results [[||^] . 

In this Letter we report a novel RG approach to the 
heteropolymer binding, focusing on the statistics of rare 
attractive segments that initiate the transition. The fre- 
quency for these "good contacts" , in the case of a random 
sequence, is described by an exponential tail in the dis- 
tribution of the contact (free) energy whose decay rate 
q depends on temperature. When q reaches a critical 
value, the energy gain at these segments is sufficient to 



offset the logarithmic entropy cost to form a bound phase. 
This offers an extremely robust and generic mechanism 
of heteropolymer binding transition at strong disorder in 
any dimension. The critical behavior around the tran- 
sition is analyzed in a real-space RG calculation. Com- 
bined with extensive simulation results on the directed 
random heteropolymer, we conclude that, in the disor- 
der dominated regime, the binding transition becomes 
infinite order, bearing many of the characteristics of the 
Kosterlitz-Thouless-Berezinskii (KTB) transition in the 
two-dimensional XY model p~C| ] . 

To set the scene, let us recall the directed polymer 
(DP) problem on a two-dimensional square lattice (r, t) 
with a columnar defect at r = 0. For simplicity, we re- 
strict the transverse displacement of the DP at each step 
At = 1 to three possible values, Ar — —1,0, 1. A step 
on the defect at r = picks up an energy rj(t) which de- 
pends on the location of the contact (or monomer index) 
t. In the case of a random heteropolymer, the contact 
energies r/(t) are taken to be identically distributed and 
independent from each other. 

The above model is easily generalized to the case of a 
directed walk on a (d+ l)-dimensional hypercubic lattice, 
interacting with a line defect. The role of extra transverse 
dimensions is best appreciated in the "necklace" repre- 
sentation jl| , where the partition sum of a DP of length 
L is expressed as, 



Z{L) 



E 



exp[-H({U})/T\. 



(1) 



0<t!<t 2 <...<t n <L 



Here ti is the monomer index of the ith contact, and 



H({ti}) 



(2) 



The "pair potential" u(t) is obtained by summing over 
thermal paths in the bulk connecting two successive con- 
tacts separated by a distance t. It has the generic form, 

u(t)/T = f t + (3(d)\nt + fi + Oir 1 ), (3) 

where /o is the reduced free energy per unit length in the 
bulk, (3(d) = 1 + — 2 1 is a universal exponent charac- 
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terizing the first return probability of a random walker in 
d dimensions, and \i is a model-specific parameter fTl]] . 

As emphasized by Fisher M, the contact point rep- 
resentation (|l|) unifies a large class of polymer binding 
problems including, under certain approximations, poly- 
mers that are self-avoiding or interacting with a surface. 
The linear term in t in (|3|) drops out when the ratio 
Z(L)/Z (L) is considered, where Z (L) = exp(— f L) is 
the partition function of a free polymer. The dominant 
interaction at large distances is given by the logarithmic 
term. Since the density of contacts vanishes at a contin- 
uous transition, the critical properties are governed by 
the value of /3, which serves to define universality classes. 
For this class of models, it suffices to consider the DP 
problem at d < 2, which is the case we focus on below. 

Previous analytical studies, based on a perturbative 
treatment of the contact potential, ran into difficulties 
when disorder becomes relevant [@,||J|]. To elucidate 
how strong disorder modifies the transition, we exam- 
ine first an extreme case where the contact potential is 
infinite everywhere except on a small number of ran- 
domly distributed sites of density A. The reduced con- 
tact potential Xi = rj(ti)/T on these sites is assumed to 
be negative and exponentially distributed with a prob- 
ability density function (PDF) p(x) = qexp(qx), where 
q is the decay rate. For q < 1, the one-contact parti- 
tion sum z = J2i<i<L ex P( — x i) °f a polymer of length 
L 1/A is dominated by the strongest binding site (i.e., 
glassy). The typical value of the potential on this site is 
x m ~ — (j -1 hi(LA). The excess reduced free energy of a 
single contact is then estimated to be, 



AF/T : 



+ (3\nL = (f3-q~ 1 )\nL + const. (4) 



Hence, for q < q c = a sufficiently long polymer will 
make use of the rare occurence of relatively large binding 
energy (of order In L) on a single monomer to form the 
bound phase. By setting AF ~ T, we estimate a "lo- 
calization length" £|| ~ exp(const/\q — q c \). Polymers of 
length L < £|| are typically free, while those with L > ^ 
are typically bound. 

Although real polymers seldom (if at all) qualify for the 
above description at the monomer scale, it turns out that 
the exponential tail can be self-generated under coarse- 
graining. This should come across as no surprise by 
noting the following, (i) For a random heteropolymer, 
a sequence of consecutive negative 77's appears sponta- 
neously. The binding energy is proportional to the se- 
quence length n which follows the exponential distribu- 
tion, (ii) Above the transition temperature T c , (Z q }/ZQ 
diverge with polymer length for q > q(T), while those 
with q < q(T) remain finite. This implies that the PDF 
of Z has a power-law tail Z" 9 ^" 1 towards large val- 
ues. Equivalently, the PDF of the reduced free energy, 
F/T = —\nZ, decays exponentially at rate q(T). 

The task now is to devise a suitable framework to track 
the renormalization of the contact energy distribution 



due to multiple contacts. We have succeeded in achieving 
this end under a real-space RG scheme. The main ideas 
are outlined below. The contact point representation (||) 
defines a one-dimensional lattice gas model. Consider a 
segment of b sites along the chain which is mapped to 
a single site under a block transformation. The site is 
occupied (i.e., contains a contact) if at least one of the 
6 sites on the segment is occupied, and empty otherwise. 
In the former case, the Boltzmann weight of a contact, 
u>i = exp[— rj(ti)/T], transforms as, 



w 



JJ(1 + «;<)-!. 



(5) 



i=l 



Rescaling t — > bt reduces rabya factor £r due to the log- 
arithmic interaction. Hence the following iterative RG 
equation is seen to hold, 



l+w = 



1 b 



1 



(6) 



For 6=2, Eq. (^|) coincides with the iterative equa- 
tion for the partition function Z = 1 + w on the Berker 
lattice given by Derrida et al. j| A full treatment of the 
problem can be formulated in terms of the PDF P(x) of 
the reduced contact free energy x = — ln(l + w). From 
the distribution of the sum y = J2i x %i 

r b b 

Q{y) = / dxi ■ ■ ■ dxbS ( y - ^ x^j J] P(xj), (7a) 
•* j=l i=l 

we obtain the distribution of x = — ln(l + w), 



dx 



(7b) 



Here ip(x) = x — f3\nb — ln[l + (b~P — 1) exp(x)l is 
obtained from (^) by solving for y. 

In the limit b — > 1, Eqs. (^) define a RG flow in the 
space of distributions. Since Q(x) is a convolution of 
P(x), it is convenient to introduce the Laplace transform, 

P(s) = [ dxe~ sx P(x). 

J —00 

Collecting terms to the first order in dl = In b, we obtain, 



dP(s) 
dl 



P(s)lnP(s)-/3s[P(s)-P(s-l)}. (8) 



Consider first the homopolymer problem described by 
P(s) = exp(— as). The parameter a, which defines an 
effective free energy on a given scale, satisfies, 



da/dl = a + (3(l-e a ). 



(9) 



The stable fixed point at a = corresponds to the un- 
bound phase where the contact potential rcnormalizcs 
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to infinity. Transition to the bound phase occurs at 
a = a c < 0. 

The PDF P(x) = Aqexp(qx) + (1 - A)5(x) considered 
in the single-contact model above corresponds to 



P(s) 



l+As/(q-s), s<q; 
co, s > q. 



(10) 



Setting ^4 = 0, we obtain a family of fixed-point functions 
of (g), parametrized by q. Writing P(s) = (Z s ), where 
Z = 1 + w represents a certain restricted partition func- 
tion, we see that these fixed-points describe correctly the 
behavior of the system above the binding transition. 

It can be shown that, within the class of functions that 
have a power-law singularity at s = q, ( |Io|) is the only 
one that is renormalizable Uj. we have derived a two- 
parameter RG flow equation from (||) by matching terms 
that diverge as s — -> q. For A< 1, the result reads, 



dq/dl = -T^qA, 
dA/dl = (1 - qp)A + 



2-9 
2q 



A 2 



(11a) 
(lib) 



The RG flow obtained by integrating (jTT|) at {3 = 3/2 
is illustrated in Fig. |l|(a), where q is plotted against 
A 1 / 2 in analogy with the KTB transition. The criti- 
cal manifold (T = T c ) is indicated by the thick line. 
For T > T c , the flow ends on the A = axis at a 
q > q c = 1//3. In contrast, below but close to T c , A 
first decreases, reaches a minimum value and then in- 
creases again. Integrating the flow equation near q = q c 
and A — yields an exponentially diverging correlation 
length £|| ~ £ exp(a|T - T c r 1/2 )- As in the KTB tran- 
sition, the free energy per unit length has an essential 
singularity when approaching T c from below, 



f s ~ T/£n = (T/&) exp(-a|T - T C \~V 2 ) 



(12) 
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FIG. 1. RG flow from Eqs. uLW (a) and from full iteration 
of the distribution on the Berker lattice (b) (see text). 

We have checked the above predictions against (i) full 
iteration of Eqs. (^) at b — 2 and ,8 = 3/2 (Berker lattice 



with a branching ratio n — 2\/2), and (ii) transfer matrix 
calculation in (1+1) dimensions. In both cases, the con- 
tact energy is taken to be rj — uq + 2g 2 + 2ge, where Uq is 
the strength of the contact potential at the homopolymer 
binding transition, and e is a random variable satisfying 
the normal distribution. With this choice, the transi- 
tion temperature of the annealed problem (as defined by 
F a ee -Tln(Z)) is fixed at T a = 1. The strength of the 
disorder can be tuned by varying g. 

To probe the effective contact potential on a given scale 
L on the square lattice, we have computed the parti- 
tion sum Z(L) over all directed paths that start at (0, 0) 
and end at any point on the t = L line. The quantity 
x{L) ee — ln[Z(2L)/Z(L)], computed for a given realiza- 
tion of the disorder, yields a single-scale contact poten- 
tial Figure 2(a) shows the integrated distribution 
II(a;) for two temperatures at g = 2 and a sample size 
10 5 . The exponential tail quickly develops as L increases. 
At T = T a (top), the tail decays as exp(j;). The pre- 
cxponential amplitude decreases as a power of the length. 
At T = T c (bottom) , the decay rate of the tail approaches 
the critical value q c = 2/3, while the amplitude changes 
much slower as compared to higher temperatures. In 
both cases, the typical value of x(L) approaches ^ln2 
(dotted lines) expected for a uniform repulsive potential. 
For comparison, the integrated distribution of x = — In Z 
on the Berker lattice, obtained through numerical itera- 
tion of (^), are shown in Fig. 2(b). A similar trend 
is observed, though the data spans over a much bigger 
range and is much less noisy. 
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FIG. 2. Evolution of the integrated PDF II(x) as L in- 
creases in powers of 4 (in the direction of the thick arrows), 
(a) Square lattice with periodic boundary conditions and a 
lateral size of 2 x 1024 sites (g — 2, 10 samples), (a) Berker 
lattice at n = 2\/2 and g = 1. 
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Using the Berker lattice data, we have fitted the tail of 
II(a;) to the exponential form A exp(qx) . The evolution of 
A(L) and q(L) in the critical region is shown in Fig. |l|(b). 
Each data set corresponds to an increasing series of L in 
powers of four (from right to left) at a given temperature. 
The resemblance to the RG flow diagram in Fig. |l|(a) is 
apparent. Therefore the two-parameter flow equations 
(pi]), although only approximate in construct, correctly 
captures the asymptotic renormalization of the tail. 



(a) (b) 
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FIG. 3. Transfer matrix results for the square lattice, (a) 
Mean free energy ~{lnZ) as a function of L for <? = 3 and 
T = 0.15, 0.25, 0.35, 0.45, 0.55 and 0.65 (from bottom to top), 
(b) Variation of £y and / with T for g — 1.5, 2, 2.5 and 3 (from 
right to left). 

On the square lattice, the localization length £m in 
the bound phase can be determined from the mean free 
energy — (hi Z(L)) which goes through a maximum at 
L = as shown in Fig. 3(a). For a polymer of length 
L < —(InZ) increases with L and approaches the 
asymptotic behavior i In L (dashed line) of a purely re- 
pulsive line. Thus, even at T < T c , a sufficiently short 
polymer (or a short segment of a long polymer) is typi- 
cally repelled by the target object. The attraction, due to 
rare events, is felt (in a typical realization of the disorder) 
only when L > The asymptotic behavior at L £|| 
is given by (in Z(L)) ~ — fL, where f(T) is the reduced 
free energy per unit length in the thermodynamic limit. 
The variation of £y and / with T, as shown in Fig. ^|(b), is 
in agreement with Eq. ( |l2| ) . Extrapolating the curves to 
the horizontal axis (as indicated by the dashed lines), we 
obtain T c /T a = 0.96, 0.9, 0.85, and 0.8 for g = 1.5, 2, 2.5, 
and 3, respectively. These values agree with the analysis 
based on the evolution of the tails of the distribution. 
Details of our study will be published elsewhere. 

In summary, we propose that the random heteropoly- 
mer binding transition at strong disorder is induced by 
rare fluctuations of the contact potential. A real-space 
RG scheme is presented to analyze the evolution of the 
PDF of the contact potential under successive block 
transformations. Within a two-parameter Ansatz for the 



tail of the distribution, a RG flow similar to that of the 
KTB transition is derived. The analysis yields an ex- 
ponentially diverging correlation length and an essential 
singularity for the free energy as T c is approached from 
the bound side. These predictions, which hold in all di- 
mensions when disorder is relevant, are confirmed by ex- 
tensive numerical calculations on the Berker lattice and 
on the square lattice in (1+1) dimensions. 

Finally, we comment briefly on the relationship be- 
tween our work and previous numerical studies |||]||] 
of the DP binding transition on the square lattice, which 
were done on binary disorder with r\ = p± A. By looking 
at the evolution of P(x) at T = T a , we discover that, even 
for A = oo (while keeping p— A finite), the tail develops 
at a much slower rate than for gaussian disorder. Since 
d = 1 is marginal with regard to the relevance of weak 
disorder, we expect results reported previously to be in- 
fluenced, to a lesser or greater extent, by crossover ef- 
fects. Indeed, the transition temperatures T c determined 
in these studies, using methods different from ours, are 
either indistinguishable or only slightly below T a of the 
corresponding annealed problem. We leave a detailed dis- 
cussion of this and other issues to a future publication. 
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